Line stylization through graphics processor unit (gpu) textures

ABSTRACT

A method, apparatus, system, and computer program product provide the ability to render a line having line stylization/linetype pattern via texture mapping of a graphics processing unit (GPU). Linetype information for a pattern of a linetype for the line is acquired. The pattern is stored in a texture by encoding a type of element of the pattern and a texel center location. The GPU renders the line by computing a distance between a pixel of the line and the texel center location, determining if the distance exceeds a threshold, and rendering the pixel if the distance is within the threshold.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation under 35 U.S.C. § 120 of applicationSer. No. 14/798,165 (corresponding to Attorney Docket No.:30566.518-US-U1), filed on Jul. 13, 2015, with inventor(s) Sean P.James, David Timothy Rudolf, and Ravinder Patnam Krishnaswamy, entitled“Line Stylization Through Graphics Processor Unit (GPU) Textures,” whichapplication is incorporated by reference herein, and which applicationclaims the benefit under 35 U.S.C. Section 119(e) of the followingco-pending and commonly-assigned U.S. provisional patent application(s),which is/are incorporated by reference herein: Provisional ApplicationSer. No. 62/023,659, filed on Jul. 11, 2014, by Sean P. James, DavidTimothy Rudolf, and Ravinder Patnam Krishnaswamy, entitled “GPUIMPLEMENTATION OF ZOOM DEPENDENT LINE TYPES,” attorneys' docket number30566.518-US-P1.

BACKGROUND OF THE INVENTION 1. Field of the Invention

The present invention relates generally to computer-aided design (CAD)applications, and in particular, to a method, system, apparatus, andarticle of manufacture for stylizing lines using graphics processor unit(GPU) textures.

2. Description of the Related Art

Providing linetypes for two-dimensional (2D) CAD geometry is a corefunctionality in CAD applications. However, 2D CAD geometry has been amostly neglected problem. In prior art 2D products, the problem wasviewed as a central processing unit (CPU) problem, solved bytessellating the geometry into many line segments based on thelinetypes. This means that extra data needs to be sent to the GPU forrendering, which reduces performance. A single line could be broken upinto dozens or even thousands of line segments. While there have beenGPU techniques for rendering continuous curves analytically, e.g. Beziercurves, the application of the GPU for basic 2D type drafting patternsand geometry has not been attempted in the prior art. In this regard,providing very high quality 2D line data with linetypes that conform tostandards in a highly performance solution has not been possible in theprior art.

SUMMARY OF THE INVENTION

Embodiments of the invention leverage the power of the graphicsprocessing unit (GPU) for efficient rendering of linetypes,specifically, the use of the texture as a mechanism to encode linetypepattern definitions. Embodiments of the invention use a two (2) channeltexture to accomplish drawing ‘dots’ for CAD linetypes so they appear asdots at any zoom level. Instead of using textures for color information,texture parameter data about the geometry of the linetype (specificallythe texture parameter at the midline of a texel) is stored. Thus, for‘dots’ in a pattern, to handle zoom—the numeric value in the ‘color’ ofthe texture data is used to acquire an estimate of the pixel distancefrom the analytic ‘dot’. Checks are then performed using the pixelshader of the GPU. This technique enables the ability to send a singleline to the GPU, and have the pixel shader effectively break that lineinto many segments that make up the linetype, thus achieving the samerendering with much increased performance.

BRIEF DESCRIPTION OF THE DRAWINGS

Referring now to the drawings in which like reference numbers representcorresponding parts throughout:

FIG. 1 is an exemplary hardware and software environment used toimplement one or more embodiments of the invention;

FIG. 2 schematically illustrates a typical distributed computer systemusing a network to connect client computers to server computers inaccordance with one or more embodiments of the invention;

FIG. 3 illustrates the typical tessellation for a simple “faceplate”part showing some “jaggies” or aliasing in the prior art;

FIG. 4 illustrates the results of using analytic rendering (i.e., of thesame faceplate as that of FIG. 3) from within a pixel shader of a GPU inaccordance with one or more embodiments of the invention;

FIG. 5 illustrates the processing performed by the CPU and GPU for linesin accordance with one or more embodiments of the invention;

FIG. 6 illustrates a bell-curve that is used for anti-aliasing inaccordance with one or more embodiments of the invention;

FIG. 7 illustrates a tessellated ellipse created in accordance with theprior art;

FIG. 8 illustrates an analytic ellipse rendered through shaders inaccordance with one or more embodiments of the invention;

FIG. 9 illustrates a dialog window with an exemplary listing oflinetypes/linetype names, linetype appearances, and linetypedescriptions in accordance with one or more embodiments of theinvention;

FIG. 10 illustrates a DirectX™ (DX™) resource viewer view of thelinetypes identified in FIG. 9 in accordance with one or moreembodiments of the invention;

FIG. 11 illustrates a graph of a solution of elliptical integrals solvedusing approximation techniques in accordance with one or moreembodiments of the invention;

FIG. 12 illustrates linetypes that follow a curve in accordance with oneor more embodiments of the invention;

FIG. 13 illustrates zoom invariants in accordance with one or moreembodiments of the invention;

FIG. 14 illustrates zoom variants in accordance with one or moreembodiments of the invention;

FIG. 15 illustrates bilinear texture sampling where there is fuzzing atthe ends of the dashes in accordance with one or more embodiments of theinvention;

FIG. 16 illustrates the results of point sampling to avoid fuzzing inaccordance with one or more embodiments of the invention;

FIG. 17 illustrates a curve with a linetype pattern of a dash and twodots in accordance with one or more embodiments of the invention;

FIG. 18 illustrates a zoomed-in view of the linetype pattern of FIG. 17with dots remaining the same size while the dashes increase in length inaccordance with one or more embodiments of the invention;

FIG. 19 illustrates the need to select a subset of pixels (to turn on)if zoomed in sufficiently, where more than one pixel maps to a texel inaccordance with one or more embodiments of the invention; and

FIG. 20 illustrates the logical flow for rendering a line having linestylization/a linetype pattern via the texture mapping functionality ofa graphics processing unit (GPU).

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

In the following description, reference is made to the accompanyingdrawings which form a part hereof, and which is shown, by way ofillustration, several embodiments of the present invention. It isunderstood that other embodiments may be utilized and structural changesmay be made without departing from the scope of the present invention.

Overview

Basic linetypes in CAD consist of dashes, spaces, and dots, withspecifications in terms of relative length of dashes and spaces andlocations of dots. Embodiments of the provide a technique to render alinetype that has a simultaneous zoom dependent and zoom independentpiece by using the texture to store information.

Unique aspects of embodiments of the invention include:

(1) Using a texture to draw a linetype and using a cumulative lengthbased parameterization to map the line pattern using a texture buffer;and

(2) Storing texel parameter information in the texture itself in orderto be able to draw a dot by interpreting the texture value, normallyused as a color or color component, as a reference texture coordinate.

Embodiments of the invention provide the ability to encode textureinformation for a linetype that includes a dot and a dash, where thedash size depends on zoom level but the dot size is fixed at all zoomlevels. Such encoding is done by using a texture channel to storeinformation about the pattern—the dash/dot type, and the texelparameter.

Additional embodiments of the invention may use a similar technique torender hatch patterns.

Hardware Environment

FIG. 1 is an exemplary hardware and software environment 100 used toimplement one or more embodiments of the invention. The hardware andsoftware environment includes a computer 102 and may includeperipherals. Computer 102 may be a user/client computer, servercomputer, or may be a database computer. The computer 102 comprises acentral processing unit (CPU) 104A (also referred to as a generalpurpose hardware processor) and/or a graphics processing unit (GPU) 104B(also referred to as a special purpose hardware processor) and a memory106, such as random access memory (RAM). The computer 102 may be coupledto, and/or integrated with, other devices, including input/output (I/O)devices such as a keyboard 114, a cursor control device 116 (e.g., amouse, a pointing device, pen and tablet, touch screen, multi-touchdevice, etc.) and a printer 128. In one or more embodiments, computer102 may be coupled to, or may comprise, a portable or mediaviewing/listening device 132 (e.g., an MP3 player, iPod™, Nook™,portable digital video player, cellular device, personal digitalassistant, etc.). In yet another embodiment, the computer 102 maycomprise a multi-touch device, mobile phone, gaming system, internetenabled television, television set top box, or other internet enableddevice executing on various platforms and operating systems.

In one embodiment, the computer 102 operates by the general purposeprocessor 104A performing instructions defined by the computer program110 under control of an operating system 108. The computer program 110and/or the operating system 108 may be stored in the memory 106 and mayinterface with the user and/or other devices to accept input andcommands and, based on such input and commands and the instructionsdefined by the computer program 110 and operating system 108, to provideoutput and results. The GPU 104B is configured to rapidly manipulate andalter memory to accelerate the creation of images in a frame bufferintended for output to a display. In particular, GPUs 104B are veryefficient at manipulating computer graphics and image processing in ahighly parallel structure that makes them more effective than generalpurpose CPUs 104A when processing large blocks of data in parallel. AGPU 104B may be present on a video card, or can be embedded on amotherboard, or may be part of a mobile device (e.g., a cellular phone).

Output/results may be presented on the display 122 or provided toanother device for presentation or further processing or action. In oneembodiment, the display 122 comprises a liquid crystal display (LCD)having a plurality of separately addressable liquid crystals.Alternatively, the display 122 may comprise a light emitting diode (LED)display having clusters of red, green and blue diodes driven together toform full-color pixels. Each liquid crystal or pixel of the display 122changes to an opaque or translucent state to form a part of the image onthe display in response to the data or information generated by theprocessor 104A/104B from the application of the instructions of thecomputer program 110 and/or operating system 108 to the input andcommands. The image may be provided through a graphical user interface(GUI) module 118. Although the GUI module 118 is depicted as a separatemodule, the instructions performing the GUI functions can be resident ordistributed in the operating system 108, the computer program 110, orimplemented with special purpose memory and processors.

In one or more embodiments, the display 122 is integrated with/into thecomputer 102 and comprises a multi-touch device having a touch sensingsurface (e.g., track pod or touch screen) with the ability to recognizethe presence of two or more points of contact with the surface. Examplesof multi-touch devices include mobile devices (e.g., iPhone™, Nexus S™,Droid™ devices, etc.), tablet computers (e.g., iPad™, HP Touchpad™),portable/handheld game/music/video player/console devices (e.g., iPodTouch™, MP3 players, Nintendo 3DS™, PlayStation PortableTM etc.), touchtables, and walls (e.g., where an image is projected through acrylicand/or glass, and the image is then backlit with LEDs).

Some or all of the operations performed by the computer 102 according tothe computer program 110 instructions may be implemented in the GPU 104Band or a combination of the CPU 104A and GPU 104B. In such anembodiment, some or all of the computer program 110 instructions may beimplemented via firmware instructions stored in a read only memory(ROM), a programmable read only memory (PROM) or flash memory within theGPU 104B or in memory 106. The GPU 104B may also be hardwired throughcircuit design to perform some or all of the operations to implement thepresent invention. Further, the GPU 104B may be a hybrid processor,which includes dedicated circuitry for performing a subset of functions,and other circuits for performing more general functions such asresponding to computer program 110 instructions. In one embodiment, theGPU 104B is an application specific integrated circuit (ASIC).

The computer 102 may also implement a compiler 112 that allows anapplication or computer program 110 written in a programming languagesuch as C, C++, Assembly, SQL, Python, Prolog, MATLAB, Ruby, Rails,Haskell, or other language to be translated into processor 104A/104Breadable code. Alternatively, the compiler 112 may be an interpreterthat executes instructions/source code directly, translates source codeinto an intermediate representation that is executed, or that executesstored precompiled code. Such source code may be written in a variety ofprogramming languages such as Java™, JavaScript™, Perl™, Basic™, etc.After completion, the application or computer program 110 accesses andmanipulates data accepted from I/O devices and stored in the memory 106of the computer 102 using the relationships and logic that weregenerated using the compiler 112.

The computer 102 also optionally comprises an external communicationdevice such as a modem, satellite link, Ethernet card, or other devicefor accepting input from, and providing output to, other computers 102.

In one embodiment, instructions implementing the operating system 108,the computer program 110, and the compiler 112 are tangibly embodied ina non-transitory computer-readable medium, e.g., data storage device120, which could include one or more fixed or removable data storagedevices, such as a zip drive, floppy disc drive 124, hard drive, CD-ROMdrive, tape drive, etc. Further, the operating system 108 and thecomputer program 110 are comprised of computer program 110 instructionswhich, when accessed, read and executed by the computer 102, cause thecomputer 102 to perform the steps necessary to implement and/or use thepresent invention or to load the program of instructions into a memory106, thus creating a special purpose data structure causing the computer102 to operate as a specially programmed computer executing the methodsteps described herein. Computer program 110 and/or operatinginstructions may also be tangibly embodied in memory 106 and/or datacommunications devices 130, thereby making a computer program product orarticle of manufacture according to the invention. As such, the terms“article of manufacture,” “program storage device,” and “computerprogram product,” as used herein, are intended to encompass a computerprogram accessible from any computer readable device or media.

Of course, those skilled in the art will recognize that any combinationof the above components, or any number of different components,peripherals, and other devices, may be used with the computer 102.

FIG. 2 schematically illustrates a typical distributed computer system200 using a network 204 to connect client computers 202 to servercomputers 206. A typical combination of resources may include a network204 comprising the Internet, LANs (local area networks), WANs (wide areanetworks), SNA (systems network architecture) networks, or the like,clients 202 that are personal computers or workstations (as set forth inFIG. 1), and servers 206 that are personal computers, workstations,minicomputers, or mainframes (as set forth in FIG. 1). However, it maybe noted that different networks such as a cellular network (e.g., GSM[global system for mobile communications] or otherwise), a satellitebased network, or any other type of network may be used to connectclients 202 and servers 206 in accordance with embodiments of theinvention.

A network 204 such as the Internet connects clients 202 to servercomputers 206. Network 204 may utilize ethernet, coaxial cable, wirelesscommunications, radio frequency (RF), etc. to connect and provide thecommunication between clients 202 and servers 206. Clients 202 mayexecute a client application or web browser and communicate with servercomputers 206 executing web servers 210. Such a web browser is typicallya program such as MICROSOFT INTERNET EXPLORER™, MOZILLA FIREFOX™,OPERA™, APPLE SAFARI™, GOOGLE CHROME™, etc. Further, the softwareexecuting on clients 202 may be downloaded from server computer 206 toclient computers 202 and installed as a plug-in or ACTIVEX™ control of aweb browser. Accordingly, clients 202 may utilize ACTIVEX™components/component object model (COM) or distributed COM (DCOM)components to provide a user interface on a display of client 202. Theweb server 210 is typically a program such as MICROSOFT'S INTERNETINFORMATION SERVER™.

Web server 210 may host an Active Server Page (ASP) or Internet ServerApplication Programming Interface (ISAPI) application 212, which may beexecuting scripts. The scripts invoke objects that execute businesslogic (referred to as business objects). The business objects thenmanipulate data in database 216 through a database management system(DBMS) 214. Alternatively, database 216 may be part of, or connecteddirectly to, client 202 instead of communicating/obtaining theinformation from database 216 across network 204. When a developerencapsulates the business functionality into objects, the system may bereferred to as a component object model (COM) system. Accordingly, thescripts executing on web server 210 (and/or application 212) invoke COMobjects that implement the business logic. Further, server 206 mayutilize MICROSOFT'S™ Transaction Server (MTS) to access required datastored in database 216 via an interface such as ADO (Active DataObjects), OLE DB (Object Linking and Embedding DataBase), or ODBC (OpenDataBase Connectivity).

Generally, these components 200-216 all comprise logic and/or data thatis embodied in/or retrievable from device, medium, signal, or carrier,e.g., a data storage device, a data communications device, a remotecomputer or device coupled to the computer via a network or via anotherdata communications device, etc. Moreover, this logic and/or data, whenread, executed, and/or interpreted, results in the steps necessary toimplement and/or use the present invention being performed.

Although the terms “user computer”, “client computer”, and/or “servercomputer” are referred to herein, it is understood that such computers202 and 206 may be interchangeable and may further include thin clientdevices with limited or full processing capabilities, portable devicessuch as cell phones, notebook computers, pocket computers, multi-touchdevices, and/or any other devices with suitable processing,communication, and input/output capability. Embodiments of the inventionare implemented as a software application on a client 202 or servercomputer 206. Further, as described above, the client 202 or servercomputer 206 may comprise a thin client device or a portable device thathas a multi-touch-based display.

Of course, those skilled in the art will recognize that any combinationof the above components, or any number of different components,peripherals, and other devices, may be used with computers 202 and 206.

Software Embodiment Introduction

Linetypes are one of the most basic aspects of 2D CAD drawings.Linetypes consist of dashes, spaces, and dots. In the prior art, as auser zooms in or zooms out, the pattern of the linetype may change(e.g., dots becomes dashes as you zoom in). It is desirable for thepattern to stay constant or to present the option for the pattern tostay constant. Such a problem is a fundamental problem encountered byanyone trying to draw a line with a linetype. To enable such a constantpattern, additional information associated with the geometry may berequired. With the increasing relevance of GPUs, including on mobiledevices, embodiments of the invention utilize GPU processing (and theability handle textures by a GPU) to enable and maintain linetypeinformation in a consistent and expected manner.

A core value of CAD applications is its 2D CAD documentation. A majorityof the documentation may consist of simply line drawing—line, polyline,and arc geometry, with basic line types. While traditional methods usingCPU based tessellation to draw linetypes are useable, the artifacts aredefinitely noticeable. The GPU provides the opportunity fordifferentiation in terms of the quality, performance and capacity forutilizing such linetypes. FIG. 3 illustrates the typical tessellationfor a simple “faceplate” part showing some “jaggies” or aliasing in theprior art. In FIG. 3, the aliasing is visible in the circles 302 thatare not smooth or consistent. In addition, further tessellation does notnecessarily improve the results. Through careful consideration of thepixel distance from the exact curve, better results may be achieved. Inthis regard, FIG. 4 illustrates the results of using analytic rendering(i.e., of the same faceplate as that of FIG. 3) from within a pixelshader of a GPU in accordance with one or more embodiments of theinvention. As can be seen, the circles 302 are smooth without aliasing.

Given the demographics of CAD users and the typical hardware/softwareconfigurations for such users, embodiments of the invention leverage GPUfunctionality (e.g., including the functionality in DX9 shader model 3level), and provide a high level of quality and functionality for basic2D rendering. Such capabilities has implications not only for the widehardware base for CAD applications, but given that mobile devices alsohave functionality and power restrictions, the techniques utilized byembodiments of the invention can be leveraged on such mobile devices aswell.

The classic geometry types in CAD applications are lines, circles,circle arcs, ellipses, and elliptical arcs. Such geometry are typicallytessellated line segments. Embodiments of the invention provide theability, for all of the geometric types, to:

(1) Create a minimal envelope on the CPU (i.e., render “envelope”geometry around the curve);

(2) Modify the envelope to the right location for the curve in thevertex shader (on the GPU); and

(3) Render the curve exactly with high quality in the pixel shader (onthe GPU) (i.e., solve the curve equation in the pixel shader).

For example—for lines, circles and arcs, a quadrilateral envelope foreach is created on the CPU and supplied to the GPU. The ellipse andelliptical arcs also get the circle envelope. The objective (in such animplementation) is to minimize work on the CPU. Accordingly, the CPUsends normalized envelope data to the GPU. The GPU vertex shader thentransforms the envelope to fit the actual curve.

FIG. 5 illustrates the processing performed by the CPU and GPU for linesin accordance with one or more embodiments of the invention. FIG. 5 maybe viewed as a pipeline from left to right with data flowing from theCPU 502 into the GPU 504. As an overview, the CPU creates canonicalgeometries in the form of the shared vertex data 506 for the instancedata 508. For example, in the first row, rectangle 510 provides thecanonical geometry for the line instance data 512. The CPU prepares thecanonical geometry 506 and sends it to the GPU 504. The GPU 504 hasmultiple processors that work in parallel and typically performprocessing in stages that includes a vertex shader 514 (that manipulates3D vertices prior to rasterization) and a pixel shader 516 (also knownas a fragment shader) (that computes color and other attributes of each“fragment” or single pixel). In the vertex shader 514, the GPU 504 usesthe canonical shared vertex data 510 and instance data 408 to transformthe endpoints/vertices, in parallel. Thus, the CPU 502 produces thecanonical geometry 506 and passes all of the different canonicalgeometry line segments simultaneously to the GPU 504. The vertex shader514 in the GPU 504 then transforms the vertices to instance data (e.g.,3D rotation for an ellipse) and the results are analytically renderedvia the pixel shader 516.

In view of the general overview of FIG. 5 set forth above, for theellipses, 3D processing is leveraged on the GPU 504 (i.e., the envelopeis transformed to the ellipse in the vertex shader 514 by performing theappropriate 3D rotation of the circle). The vertex shader 504 transformsthe end points of the envelopes to (float) screen coordinates. The zoomfactor is taken into account in computing the transformed vertices (e.g.for the line envelope) as illustrated by the following code fragment.

// Calculate right vector float2 right = float2(−forward.y, forward.x);// Extend the line envelope more as the zoom level increases to //improve pixel coverage float zoomExt = max(common.ZoomLevel, 1); //Calculate offset along forward and right vectors float forwardAmt =len * input.position.x; float rightAmt  = input.width / 2.0f *input.position.y; // Transform position to contain curve float2 position= input.p1 + forward * forwardAmt + right * rightAmt *    zoomExt;where forward is the unit vector from p1 to p2, len is the length of thevector from p1 to p2 and input.position is the canonical envelopevertex. Accordingly, the rectangle is defined from (0, −1) to (1, 1),and input positions are specified (i.e., input.position.x=0 or 1,input.position.y=−1 or 1).

The (u,v) parameters are mapped so −u is the texture parameter along theline, and v is the distance (in pixel space) from the center line. Thus,in embodiments of the invention, the (u, v) parameters are used for verydifferent purposes—the u is the position along the linetype texture, andv is used in conjunction with the falloff function to anti alias thecurve. For a line, −v directly gives the distance from the line.

For a circle of radius r given the projected circle center c, for thepixel p(x, y), the distance from the curve is easily determined as(length(p−c)−r).

To render high quality 2D, there are several components. The basiccomponent includes a continuous piece of geometry (e.g., a line,ellipse, etc.). To render such continuous geometry, an envelope that iscomputed efficiently may be utilized. Further, to achieve a goodanti-aliased effect, a falloff function may be utilized (e.g., aGuassian that empirically provides the best effect). FIG. 6 illustratesa bell-curve that is used for such anti-aliasing in accordance with oneor more embodiments of the invention.

Ellipses

Ellipses are more complex than the lines/curves illustrated in FIG. 5.To find the closest point to an ellipse is equivalent to minimizing:

e(θ)=√{square root over ((x−a cos(θ))²+(y−b sin(θ))²)}

which turns out to require a numerical method to solve to get the roots(something that cannot be implemented on a shader). Alternativeembodiments may utilize options set forth in the following papers thatare incorporate by reference herein:

-   -   Charles Loop and Jim Blinn, Resolution Independent Curve        Rendering Using Programmable Graphics Hardware, ACM Transactions        on Graphics (TOG)—Proceedings of ACM SIGGRAPH 2005, Volume 24        Issue 3, July 2005, Pages 1000-1009 (“Loop 2005”); and    -   Charles Loop and Jim Blinn, GPU Gems 3, Chapter 25—Rendering        Vector Art on the GPU, 2008 (“Loop 2008”).

Given the implicit function f for an ellipse and a point to shade p, onecan estimate the distance d(p) to the ellipse by:

${f\left( {p = \left( {x,y} \right)} \right)} = {\frac{x^{2}}{a^{2}} + \frac{y^{2}}{b^{2}} - 1}$${d(p)} = \frac{f(p)}{\left| {\nabla\; {f(p)}} \right|}$

The reference, Gabriel Taubin, Distance Approximations for RasterizingImplicit Curves, ACM Transactions on Graphics, Vol. 13, No. 1, January1994, Pages 3-42, which is incorporated by reference herein, has thederivation, which is based on an expansion of the first order Taylorseries for f, discarding the nonlinear terms, then applying thetriangular then Cauchy-Schwarz inequalities.

Loop 2005 solved the problem for rendering Bézier curves on the GPU—andthis solution can be extended to any implicit curve (including ellipsesas set forth herein).

The ddx( ) and ddy( ) intrinsic functions available in HLSL (high levelshading language for DirectX™) since DX9 provides the requiredapproximation to the gradient on the GPU in order to implement the aboveformula.

The following code fragment computes a first order approximation of thedistance from the ellipse in the pixel shader.

   // Get gradient of UVs    float2 dx = ddx(input.uv);    float2 dy =ddy(input.uv);    // Square ellipse radii    float r1_2 = input.size.x *input.size.x;    float r2_2 = input.size.y * input.size.y;    // Squareu, v    float u_2 = input.uv.x * input.uv.x;    float v_2 = input.uv.y *input.uv.y;    // Ellipse function derivative with chain rule    floatdfx = (2 * input.uv.x) / r1_2 * dx.x + (2 * input.uv.y) / r2_2 * dx.y;   float dfy = (2 * input.uv.x) / r1_2 * dy.x + (2 * input.uv.y) /r2_2 * dy.y;    // Approximate signed distance from curve with    //f(u,v) / |gradient(f(u,v))|    float sd = (u_2 / r1_2 + v_2 / r2_2 − 1)/ sqrt(dfx * dfx + dfy *    dfy);and sd—is used to determine the ellipse width, which, in conjunctionwith the falloff function, gives the required analytical antialiasedcurve. All of the computation here is done in the pixel shader 516. Thequality improvement is clearly discernible over the tessellated curve.

FIG. 7 illustrates a tessellated ellipse created in accordance with theprior art. In contrast, FIG. 8 illustrates an analytic ellipse renderedthrough shaders in accordance with one or more embodiments of theinvention (as described above).

Linestyles

Linestyles Introduction

Line stylization and line types have specific needs in engineeringdrawing. Handling corners around curves, and starts and ends of an opencurve are two key areas that need to be addressed. Embodiments of theinvention utilize the approach of texture sampling to produce linetypes.The advantages of this approach include:

-   -   1. Leveraging a highly used and basic GPU feature, namely        texture mapping, that is supported by even the most basic shader        models (including mobile); and    -   2. Low memory footprint—no need to tessellate geometry or create        intermediate tessellations.

Challenges of utilizing texture mapping to produce linetypes include:

1. Pattern alignment and parameterization. How to map texture parametersonto the texture to get the desired line pattern fidelity; and

2. Zoom variant linetypes. Implementing the classical linetype—where thedash length increases with zooming and decreases when zooming out. Thezoom invariant type, where dashes stay the same and realign, is simplerto implement using textures, since the pixel to texel scale is constant.

As background information, a texel or texture pixel is the fundamentalunit of texture space. Textures are represented by arrays of texels,just as pictures are represented by arrays of pixels. When texturing a3D surface (a process known as texture mapping), a renderer (within aGPU) maps texels to appropriate pixels in the output picture. Thetexturing process starts with a location in space and a projectorfunction is applied to the location to change the location from athree-element vector to a two-element vector with values ranging fromzero to one (uv). These values are multiplied by the resolution of thetexture to obtain the location of the texel.

Parameterization and Mapping (Encoding of the Linetype)

Embodiments of the invention use a 2D texture as an array of 1dimensional textures. In this regard, any mechanism that specifies a 1Dpattern may be utilized to define the texture/linetype patterns. In oneor more embodiments, the texture is programmatically generated from CADapplication (e.g., AutoCAD™) linetypes. The linetype specification forbasic linetypes is very simple—essentially the length is specified foreach component of the linetype pattern (e.g., dash, dot, and space).Each linetype may be defined on two lines in a linetype definition file.The first line contains the linetype name and an optional description.The second line is the code that defines that actual linetype patternand must begin with the letter A (alignment), followed by a list ofpattern descriptors that define pen-up lengths (spaces), pen-downlengths (dashes), and dots. Thus, the format of the linetype definitionis:

*linetype_name, description

A, descriptor1, descriptor2,

The following illustrates an exemplary linetype definition:

*ACAD_ISO05W100,ISO long-dash double-dot ______..______..______..

A,24,−3,0,−3,0,−3

The first line identifies the name of the linetype. In the second line,24 represents the dash (that is 24 units long), −3 the space (that is 3units long), 0 the dot (positive, negative and zero), followed byanother 3-unit long space, another dot, and a third 3-unit long space.Thus, the linetype ACAD_ISO05W100 contains a pattern consisting of: along dash, a space, a dot, a space, a dot, and a space (i.e., a longdash followed by two dots [referred to as double-dot]).

To store this linetype in a texture, a texture size width correspondingto the screen resolution is chosen (so in the 1-1 case, 1 pixel=1texel). The pattern is scaled to fit the row, based on linetype scalefactor, i.e. how many repetitions occur in a 1-1 pixel-texel case. Thismakes the mapping of linetypes to textures very efficient—essentiallyone NxL bitmap is programmatically generated from the linetypetable—where N=number of linetypes, L=texture resolution (screen width).

In terms of parameter mapping, the (u,v) parameter maps to:

u—position along the pattern

v—linetype

FIG. 9 illustrates a dialog window with an exemplary listing oflinetypes/linetype names, linetype appearances, and linetypedescriptions in accordance with one or more embodiments of theinvention.

The linetypes map into a 2D texture. FIG. 10 illustrates a DirectX™(DX™) resource viewer view of the linetypes identified in FIG. 9 inaccordance with one or more embodiments of the invention. The horizontalpattern reflects the linetype encoded in the row. FIG. 10 may alsoinclude a color variation. In this regard, the textures may berepresented using a 2 channel 32 bit float texture, (R,G), where:

R: Dash/Dot/Blank value—a float that indicates the type of element ofthe linetype. The R value is illustrated in FIG. 10 as the darker pixels1002. In an actual image, the R value may be encoded in a color (e.g.,blue).

G: The float parameter value for the center of the texel. For example,if the texture resolution is 20×2048 (20 linetypes×2048 texels) then forthe 12th pixel, the texel mid will be (1/2048)*(12+0.5)

This G channel is used to handle dots in the linetype as describedbelow. The G channel may not be used as a graphical component, butinstead, identifies the texel center that can then be used to determinethe distance of a pixel from the texel center.

Accordingly, the two channels (R,G) that are traditionally used for redand green are not used for color in embodiments of the invention.Instead, the red channel is used to determine the type of element (i.e.,whether it is a dot, dash, or a blank space), and the G value is used toidentify the center of the texel. Based on the information, thenumerical sequence/stream in the two channels is encoded into theimage/texture (e.g., as illustrated in FIG. 10) and utilized by the GPU.

Parameterization, Continuation and Polylines

While parameterization, in general, may exist in the prior art,embodiments of the invention utilization texel parameters to ensure acontinuous linetype across a line that consists of multiple linesegments/curves/arcs/etc. More specifically, as described above,linetype data is encoded into the two channels of a 2D texture andinclude elements that can be used to ensure continuity.

In particular, parameters that are numbers (i.e., representing the typeof pattern element [dash, dot, or space] and a texel center location),along with pieces of geometry are passed to the pixel shader. Suchparameters are pre-computed in such a way to achieve continuous andconsistent rendering of the pattern across different line segments.

To perform such rendering, one of the stored parameters is based on thecomposite length of the curve. For example, if a curve has a knownlength of 10 units, and a second curve starts at 2.5 with a repeatingpattern of unit length 1, when rendering the pattern, the GPU will startat 0.5 units in the pattern (i.e., 2.5 mod 1) (i.e., the GPU knows howmuch to offset into the pattern so that it looks continuous). In anotherexample, if a template is overlaid onto a curve (consisting of a lineand an arc), the GPU needs to begin the pattern at the correct location.Accordingly, if the line portion of the curve is 1.5 times the length ofthe template, the pattern in the template will end at the ½ way point.Accordingly, when the art is started, the pattern needs to commence atthe ½ way point. Thus, to ensure consistency and to begin the pattern atthe appropriate location, the cumulative length of the curve ispassed/stored as a parameter.

In view of the above, parameterization is based on length (or relativelength). Each geometry segment (line, arc, ellipse, etc.) has the startU parameter stored at creation time. In the vertex shader, depending onwhether the canonical envelope vertex coordinate is x=0 or x=1, thevertex U value is either:

(x=0 case): the cumulative length up to the beginning of thesegment—which for a stand-alone segment is 0.0, for a polyline is thecumulative length up to that vertex; or

(x=1 case): the cumulative length+the length of the segment itself.

Since parameter values are linearly interpolated, for a line, the valueat a pixel is determined by the texture R value at the interpolatedparameter. The R value defines if the pixel maps to a dot, dash orspace.

For Circles or Circle Arcs, mapping the pixel to length along the arc issimple—using s=r*theta where r is the radius and theta is the anglealong the arc in radians. This gives the length along the arc segment.

Ellipses are more difficult with respect to length basedparameterization. There is no simple formula to map a point on anelliptical arc to the length from the start of the ellipse. Accordingly,approximations are used with respect to ellipses. The distance along thecurve can be expressed as:

<x(θ), y(θ)=<a cos(θ), b sin(θ)>

<x′ ^((θ)) , y′ ^((θ)) =<−a sin(θ), b cos(θ)>

s(θ)=∫_(θ) ₀ ^(θ)√{square root over (x′(θ)² +y′(θ)²)}dθ

s(θ)=∫_(θ) ₀ ^(θ)√{square root over (a ² sin²(θ)+b ² cos²(θ))}dθ

This is the subject of elliptical integrals—which are solved bynumerical iteration techniques, which are not feasible forimplementation on a shader. The function

√{square root over (a² sin²(θ)+b ² cos²(θ))}

However is very similar to

${\frac{b - a}{2}{\cos \left( {2\theta} \right)}} + a + \frac{b - a}{2}$

the graph of which is illustrated in FIG. 11. Such a function can beintegrated easily, and offers a good approximation. Embodiments of theinvention may use such an approximation with good results.

FIG. 12 illustrates linetypes that follow the curve in accordance withone or more embodiments of the invention. As illustrated, the linetypesfollow the transitions at the tangents between different curve types.

Zoom Invariance and Zoom Variance with Dots

As used herein, zoom invariance means where the dashes on the screenkeep the same length irrespective of the zoom level. FIG. 13 illustrateszoom invariants in accordance with one or more embodiments of theinvention. Similarly, FIG. 14 illustrates zoom variants in accordancewith one or more embodiments of the invention. In FIG. 14, as the zoomlevel increases, the dashes on the screen change in length. In order toachieve zoom variant linetypes, embodiments of the invention use pointsampling. Point sampling was used instead of bilinear to avoid the needto average values of the pixels.

FIG. 15 illustrates bilinear texture sampling where there is fuzzing atthe ends of the dashes. In contrast, embodiments of the invention mayutilize point sampling to avoid fuzzing, thereby providing results suchas that illustrated in FIG. 16.

As described above, one key issue that needs to be handled is that ofdots. With point sampling, as one zooms in, a single dot in a patterncould map to an arbitrarily large number of pixels depending on the zoomfactor. This is where the G channel value was used. As illustrated inFIG. 17, a curve with a linetype pattern of a dash and two dots isillustrated. When zooming in, it is desirable to have the dots remainthe same size while the dashes increase in length as illustrated in FIG.18. To provide such functionality, a distance parameter is stored in atexel value as described in further detail below.

In general, the concept of parameterization exists in the prior art.However, embodiments of the application apply parameterization for GPUrendering. In particular, as described above, a 2 channel texture may beused to accomplish drawing “dots” for CAD linetypes so they appear asdots at any zoom level. Instead of using textures for color information,texture parameter data about the geometry of the linetype is stored(specifically, the parameter at the midline of a texel). Accordingly,for “dots” in a pattern, to handle zoom, the numeric value is the“color” that is used to acquire an estimate of the pixel distance fromthe analytic “dot”, with checks performed on the GPU in the pixelshader.

To restate the problem, if the user zooms in sufficiently, the problemis—how to determine how “far” the pixel is from the center of the texel.As a user zooms into a dot, more and more pixels on the screen will mapto a texel for the dot. If a pixel is more than a threshold distancefrom the center of the texel (and you know that the geometry beingrendered is a dot), then the pixel is not rendered (i.e., is not litup). Accordingly, only pixels within a threshold distance of the centerof a texel are actually rendered. The intrinsic functions ddx( ) andddy( ) that give the rate of change of a TEXTURE semantic pixel shaderinput, offer the solution.

When the value is the u of the (u,v) for a linetype, the ddx( ) partialdifferential approximation (or ddy( ) for that matter) give the delta_uvalue between adjacent pixels. Using delta u value, one can estimate thedistance from the texel center as illustrated in the following exemplarycode:

// Texture parameter in the center of the texel float cend =LineTexture.Sample(LineSampler, texCoords).g; // Critical test − test ifthe distance of the pixel center texture // parameter from the texelcenter is within a desired tolerance float fracp = frac(Param * scale+  Scale.x * .5f); if (fracp < cend && cend − fracp <  4 *abs(ddxuv.x) *scale)    tex = 1.0; else    tex = 0.0;

In the above code, ddxuv is the result of calling ddx( ). The xcomponent is the only one needed, since that is the delta u along thelinetype pattern in the texture. Param is the length based parameter onthe curve. Scale.x is the texel width (i.e. 1/number of texels per row).Scale is the zoom scale (or linetype scale).

The above code makes sure only 4 pixels to the left of the center texelvalue are drawn no matter what the zoom value. This logic enables thepixel shader to draw only 4 dots at the texel center no matter what thezoom level.

In view of the above, embodiments of the invention may view a texture asan image with each row in the image having two (2) channels (each pixelhas two floating point values). The first floating point value encodesinformation regarding whether the pixel is on/off. The second floatingpoint value encodes the texel center. One function of a GPU is tointerpolate a value based on texture parameters. Thus, the distancebetween the pixel center texture and the texel center is computed. Thepixel shader is then used to compare the computed distance to athreshold value to determine whether the pixel should be turned on ornot.

FIG. 19 illustrates the need to select a subset of pixels (to turn on)if zoomed in sufficiently, where more than one pixel maps to a texel. Toperform the analysis, the delta value between adjacent pixels (i.e., duwhich equals ddx( ).u) is computed. The grid 1900A and 1900B illustratesuch du computations. The grids 1900A/1900B illustrate the pixelresolution. Accordingly, the distance between two grid lines providesthe delta value between adjacent pixels.

If the zoom level provides that many pixels map to 1 texel (i.e., asillustrated at 1904), then a determination needs to be made regardingwhich pixels to light up. In contrast, if the zoom level indicates thatmany texels map to a single pixel (i.e., as illustrated at 1906), thenall of the pixels being rendered may be lit up/rendered. To determinewhich pixels to light up, the pixel is lit up if the absolute value ofthe texel parameter of the center of the texel subtracted from the uparameter for the pixel, is less than or equal to the delta valuebetween adjacent pixels multiplied by the number of pixels to light up(see 1908). In other words, if the pixel is greater than a thresholddistance from the center of the texel, then it is not lit up.

Line Segment Ends

To make sure that a curve doesn't end on an invisible segment, ends canbe handled by:

1. “Phase shifting” the texture map with an initial texture map offsetso that the first and last dashes are as centered as possible. This willrequire logic on the preprocessing (CPU) side;

2. Preprocessing the curve to split end geometry separately if 1 is notsatisfactory. This can be done on the preprocessing (CPU) side; and

3. Add logic in the pixel shader to detect end of curve conditions, andimplement end clamping programmatically using a heuristic. This can bedone in the pixel shader.

Logical Flow

FIG. 20 illustrates the logical flow for rendering a line having linestylization/a linetype pattern via the texture mapping functionality ofa graphics processing unit (GPU).

At step 2002, linetype information for a pattern of a linetype for theline is acquired.

At step 2004, the pattern is stored in a texture by encoding a type ofelement of the pattern and a texel center location. The type of elementmay be selected from a group consisting of a dash, a dot, and a space.The type of element and texel center location may be encoded as floatingpoint numbers. If the line is an ellipse, the texel center location isencoded by approximating a length based parameter using approximationtechniques.

Steps 2006-2012 are performed by a GPU to render the line. At step 2006,a distance between a pixel of the line and the texel center location iscomputed.

At step 2008, a determination is made regarding whether the distanceexceeds a threshold (e.g., a number of pixels).

At step 2010, if the distance is within the threshold, the pixel isrendered/drawn. However, if the threshold is exceeded, the pixel is notrendered/drawn at step 2012. Accordingly, the GPU is enabled to render aline based on a zoom operation while maintaining zoom invariance for adot(s) in the pattern.

Step 2010 may further include the ability to ensure consistency of thepattern across multiple line segments. Such an ability/capability isprovided based on a cumulative length of the line across multiple linesegments, a length of each of the multiple line segments (which may alsobe encoded), and a length of the pattern. In this regard, the positionalong the pattern where the type of element is located may be computed(and/or encoded with the texel center location) by the GPU to determinewhat portion of the pattern to begin rendering when each line segment isdrawn/rendered.

Conclusion

This concludes the description of the preferred embodiment of theinvention. The following describes some alternative embodiments foraccomplishing the present invention. For example, any type of computer,such as a mainframe, minicomputer, or personal computer, or computerconfiguration, such as a timesharing mainframe, local area network, orstandalone personal computer, could be used with the present invention.

The foregoing description of the preferred embodiment of the inventionhas been presented for the purposes of illustration and description. Itis not intended to be exhaustive or to limit the invention to theprecise form disclosed. Many modifications and variations are possiblein light of the above teaching. It is intended that the scope of theinvention be limited not by this detailed description, but rather by theclaims appended hereto.

What is claimed is:
 1. A computer-implemented method for rendering aline, comprising: acquiring, using a central processing unit (CPU),linetype information for a pattern of a linetype for the line, whereinthe pattern comprises a sequence of one or more elements, and whereineach of the one or more elements has an element type, and wherein eachelement type comprises a dot, a dash, or a space; selecting, using theCPU, a texture size width for a texture; scaling the pattern to fit thetexture size width; storing, using the CPU, the pattern in the textureby encoding for each of the one or more elements: (i) the element typefor the element; and (ii) a texel center location for a texel in thetexture that includes the element; passing, from the CPU to the GPU, thetexture and one or more geometry segments that define the line to berendered, wherein the one or more geometry segments include a compositelength; a graphics processing unit (GPU) rendering the line by:computing an offset based on the composite length and the texel centerlocation; and rendering the line utilizing the offset to determine abeginning point of the pattern, wherein a value of a pixel is determinedby the element type at the offset.
 2. The computer-implemented method ofclaim 1, wherein the texture size width corresponds to a screenresolution.
 3. The computer-implemented method of claim 1, wherein theelement type and the texel center location are encoded as colors.
 4. Thecomputer-implemented method of claim 1, further comprising: the CPUcreating a canonical geometry in a form of shared vertex data for theone or more geometry segments, wherein the canonical geometry comprisesa canonical envelope vertex coordinate; the CPU passing the canonicalgeometry to the GPU for rendering of the line.
 5. Thecomputer-implemented method of claim 4, further comprising: determiningthat the canonical envelope vertex coordinate comprises a x=0 quad; theCPU creating the composite length as a cumulative length up to abeginning of the one or more geometry segments.
 6. Thecomputer-implemented method of claim 4, further comprising: determiningthat the canonical envelope vertex coordinate comprises a x=1; the CPUcreating the composite length as the cumulative length plus a length ofthe one or more geometry segments.
 7. The computer-implemented method ofclaim 1, wherein: the one or more geometry segments define a circle orcircle arc; and the composite length comprises r*theta, wherein rcomprises a radius of the circle or arc, and theta comprises an anglealong the circle or arc in radians.
 8. The computer-implemented methodof claim 1, wherein: the one or more geometry segments define anellipse; and the composite length comprises an approximation computed asan integration of:${\frac{b - a}{2}{\cos \left( {2\theta} \right)}} + a + \frac{b - a}{2}$wherein a comprises a semimajor axis, b comprises a semiminor axis, andtheta (θ) comprises an angle slope.
 9. The computer-implemented methodof claim 1, wherein the type of element of the pattern and the texelcenter location are encoded as floating point values.
 10. Thecomputer-implemented method of claim 1, wherein: the GPU renders theline based on a zoom operation; the GPU maintains zoom invariancewherein the dash in the pattern is rendered a same length irrespectiveof zoom level.
 11. The computer-implemented method of claim 1, furthercomprising: utilizing point sampling that avoids fuzzing at ends of anydashes in the pattern.
 12. A system for rendering a line in a computersystem comprising: (a) a computer having a central processing unit (CPU)and a graphics processing unit (GPU); (b) the CPU: (1) acquiring, usinga central processing unit (CPU), linetype information for a pattern of alinetype for the line, wherein the pattern comprises a sequence of oneor more elements, and wherein each of the one or more elements has anelement type, and wherein each element type comprises a dot, a dash, ora space; (2) selecting a texture size width for a texture; (3) scalingthe pattern to fit the texture size width; (4) storing the pattern inthe texture by encoding for each of the one or more elements: (i) theelement type for the element; and (ii) a texel center location for atexel in the texture that includes the element; (5) passing, to the GPU,the texture and one or more geometry segments that define the line to berendered, wherein the one or more geometry segments include a compositelength; (c) the graphics processing unit (GPU) rendering the line by:(1) computing an offset based on the composite length and the texelcenter location; and (2) rendering the line utilizing the offset todetermine a beginning point of the pattern, wherein a value of a pixelis determined by the element type at the offset.
 13. The system of claim12, wherein the texture size width corresponds to a screen resolution.14. The system of claim 12, wherein the element type and the texelcenter location are encoded as colors.
 15. The system of claim 12,wherein the CPU further: creates a canonical geometry in a form ofshared vertex data for the one or more geometry segments, wherein thecanonical geometry comprises a canonical envelope vertex coordinate;passes the canonical geometry to the GPU for rendering of the line. 16.The system of claim 15, wherein the CPU further: determines that thecanonical envelope vertex coordinate comprises a x=0 quad; creates thecomposite length as a cumulative length up to a beginning of the one ormore geometry segments.
 17. The system of claim 15, wherein the CPUfurther: determines that the canonical envelope vertex coordinatecomprises a x=1; the CPU creates the composite length as the cumulativelength plus a length of the one or more geometry segments.
 18. Thesystem of claim 12, wherein: the one or more geometry segments define acircle or circle arc; and the composite length comprises r * theta,wherein r comprises a radius of the circle or arc, and theta comprisesan angle along the circle or arc in radians.
 19. The system of claim 12,wherein: the one or more geometry segments define an ellipse; and thecomposite length comprises an approximation computed as an integrationof:${\frac{b - a}{2}{\cos \left( {2\theta} \right)}} + a + \frac{b - a}{2}$wherein a comprises a semimajor axis, b comprises a semiminor axis, andtheta (θ) comprises an angle slope.
 20. The system of claim 12, whereinthe type of element of the pattern and the texel center location areencoded as floating point values.
 21. The system of claim 12, wherein:the GPU renders the line based on a zoom operation; the GPU maintainszoom invariance wherein the dash in the pattern is rendered a samelength irrespective of zoom level.
 22. The system of claim 12, whereinthe CPU or the GPU: utilize point sampling that avoids fuzzing at endsof any dashes in the pattern.